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Audio level control 



The present invention relates to conirolling multiple audio levels. More in 
particular, the present invention relates to a device for controlling the sound levels of a group 
of audio channels which can be rendered simultaneously. 

In modem communication devices, such as television sets, it is often possible 
to render two or more audio channels simultaneously. A television set may, for example, be 
able to provide a "spUt-screen" arrangement in which the television screen is divided into two 
or more sections, each section displaying a different video channel. The corresponding audio 
channels may be rendered using different loudspeakers. 

The sound level of these audio channels must be controlled in such a way that 
the viewer is able to Usten to one or more channels, changing the sound level of the channels 
when a commercial break starts or when a particularly interesting topic or video item is 
announced. 

United States Patent US 6,590,618 discloses a method and apparatus for 
changing a channel or varying a volume (sound) level of a television receiver having both a 
normal screen mode function and a multiple screen mode function. A remote control unit has 
a separate set of sound level keys for each of the multiple screens. Although tiie screen which 
would be shown in single screen mode is labeled "main picture", the sound levels associated 
witii the multiple screens are independentiy controlled by the user. When ttie user wants to 
Usten more closely to one of the multiple channels, (s)he has to both increase the sound level 
of that channel and/or reduce the sound level of die at least one other channel manually, 
using the separate control keys. It will be clear that this is inqMractical. 



It is an object of the present invention to overcome these and other problems 
of tiie Prior Art and to provide a device for controlling the sound levels of a group of audio 
channels which is easier to use and &cilitates the user control task. 

It is another object of the present invention to provide an audio system 
comprising such a device for cantix>lling the sound levels of a group of audio channels. 



PHNL040391EPP 



2 08.04.2004 
Accordingly, the present invention provides a device for cantrolUng the sound levels of a 
group of audio channels comprising a main channel and at least one auxiliary channel which 
can be rendered simultaneously, the device comprising: 

- user controlled selection means for selecting the main channel, and 

- automatic level adjustment means for adjusting the sound level of the at least 
one auxiliary channel relative to the main channel. 

By providing, in accordance with the present invention, automatic level adjustment means for 
adjusting the sound level of the at least one auxiliary channel relative to the sound level of 
the main channel, there is no need for tiie user to control the sound level of the auxiliary 
channel(s). 

In typical embodiments, the device of the present invention comprises user 
conti-olled level adjustment means, hereinafter called first level adjustinent means, for 
a^usting the sound level of the main channel. As the automatic level adjustinent means, 
hereinafter called second level adjustinent means, are arranged for adjusting flie auxiliary 
channels), the user has to contirol only a single channel, the main chamiel, in order to obtain 
a suitable overaU sound level. This reduces botii the amount of effort required by the user and 
tiie number of required keys on the (remote) control unit hi addition, tiie respective sounds 
levels of tiie channels may be weighted and/or mutually adjusted using various suitable 
techniques, hi this way the interference of tiie various audio channels as experienced by the 
user may be sigmficantiy reduced. 

It is noted that tiie user contixilled (first) level adjustinent means mentioned 
above are not essential and tiiat embodiments of tiie device of tiie present invention can be 
envisaged in which tiie sound level of tiie main channel is fixed, or is contirolled by level 
contiiol means external to said device. Altiiough tiie (second) level adjustinent means for 
adjusting tiie sound level of tiie auxiliary channels) are specificaUy referred to as being 
automatic, tiie user conti»lled (first) level contirol means may in certain embodiments also 
provide automatic level adjustment in addition to user a^ustment. 

It is fiirther noted tiiat the audio channels mentioned above may be part of 
communication channels containing audio (sound), video (moving images), pictiires (still 
images), text, and/or otiier content items. The present invention is particularly suitable for 
television (combined video and audio channels) but is not so limited and may also be apphed 
in systems providing audio only. 

The user controlled selection means allow a user to select one rendered (for 
example shown and/or played) channel as tiie main channel, aU otiier rendered channels are 
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designated aoxfliary channels. The user wiU typicaUy select as the main channel the channel 
which (s)he finds the most interesting to listen to. 

In a preferred embodiment the selection means are arranged for selecting 
successive available channels in response to user input. This allows the user to step through a 
succession of available channels using only a single (hardware or software) button or key. 
Alternatively, or additionally, a pluraHty of buttons could be provided, one for each channeL 

The first level adjustment means may be controlled by conventional sound 
level adjustment elements such as 'Volume np" and "volume down" buttons on a (remote) 
control unit. The second level adjustment means are automatic in that tiiey do not necessarily 
require user control but adjust the level(s) of tiie auxiUary channels) in response to, for 
example, changes in the sound level of tiie main channel or an auxiUaty channel. Although a 
virtually infinite number of diflEerent level(s) of tiie auxiliary chaimel(s) could be provided, it 
is preferred that tiie second level adjustment means provides a pluraUly of pre-set relative 
sound levels. In tiiis way is it possible to quickly and convenientiy step tiirough a number of 
levels. These pre-set levels may differ in absolute and/or relative terms, where relative is 
multipUcative witii respect to tiie level of tiie main channel. The said plurality of pre-set 
levels may be per channel and/or per user, hi tiie latter case, each user may be provided witii 
an individual series of pre-set levels. 

The pre-set levels ate preferably factory^set, but in an advantageous 
embodiment ttie pre-set relative sound levels may be altered by tiie user. This allows tiie 
sound levels of flie auxihary channels to be adapted to tiie user's preferences and/or hearing. 

In a particularly advantageous embodiment, the second level adjustment 
means are arranged for adapting tiie respective sound levels to flie content of each associated 
audio channel and/or to tiie sound source. That is, different levels or sets of levels maybe 
appUed, depending on wheflier tiie channel renders music, speech, or otiier audio content, and 
whetiier tiie sound source is cable, antenna, VCR (Video Cassette Recorder), DVD (Digital 
Video Disc) player, or any otiier source. Speech cottid be played louder than music, or vice 
versa, and sound originating fixjm a cable source could be amplified while sound originating 
firom a VCR could be attenuated. A detector could be provided for detecting sound 
characteristics of tiie audio content, and/or changes in associated video content, for example 
by motion or color analysis. Such detectors are knownper jc, an exemplary speech detector 
is disclosed in United States Patent US 5,878,391. In particularly advantageous further 
embodiments, different levels or sets of levels are used for different types of music, such as 
classical, pop, folk, and hard rock. 
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In a particularly advantageous embodiment of the device of the present 
invention, the level adjustment means are aixanged for adapting the respective sound levels to 
user preferences regarding the content of the channels. That is, user preferences with respect 
to content (movies, news items, commercials, etc.) may be stored and used to choose desired 
5 sound levels when such content is rendered. 

It is possible 1» ejctract information fixjm the channels mdicating the type of 
music bemg provided by the channel, whereby the second level adjustment means can 
automatically set the correspondmg level(s). Such information could be provided by the 
channels as meta-data, that is data describing the content of the chatmels, or could be derived 
10 fiiom the content itself, using a suitable detector as mentioned above. 

In aparticulaiiy advantageous embodiment, the second level adjustment 
means are arranged for ad^ting the respective sound levels to the signal characteristics of 
each associated audio channel. That is, the second level adjustment means of this 
embodiment are responsive to the signal characteristics and adjust the signal level(s) 
accordingly, hi a particularly advantageous embodiment, for example, the second level 
adjustment means are arranged for speech detection. More in particular, the second level 
adjustment means may further be arranged for formaut detection, prosody detection and/or 
keyword detection. This allows inteUigent software to change the sound level when a news 
item or movie begins, for example. 

It is noted that the device of the present invention preferably takes the signal 
characteristics of aU channels into account, including the main channel, and adjusts the 
level(s) of the auxiliary channel(s) in response thereto. 

In a particularly advantageous embodiment, the level adjustment means are 
arranged for tanporarily adjusting the sound level of a channel in response to the content 
25 and/or signal characteristics of at least one channel. That is, the sound level may be raised for 
a duration of approximately one second or several seconds to alert the user to a particular 
content item, for example an announcement containing a certain key word, or a particular 
type of signal, such as speech. It is preferred that the raised sound level gtaduaUy reverts to 
its original state, hi some embodiments both the increase and the decrease of the sound level 
are gradual. While the sound level of the channel containing the content item of interest may 
be temporarily raised, the sound levels of the other channel(s) being rendered may be 
tenqjorarily lowered during the same time duration so as to make the content item concerned 
more audible. 
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In any of the embodiments mentioned above, the level(s) may be adjusted by 
merely pioportionaUy adjusting the volume, for example by multiplying the audio signal with 
a gain fector which can be larger or smaUer than 1 . Additionally, or alternatively, the second 
level adjustment means may be arranged for cUpping and/or ffltering audio signals contained 
5 in the channels, preferably using "inteUigenf ' cUpping and/or filtering techniques. The audio 
signal level(s) may be compressed and/or limited (cUpping) or may be adjusted in 
dependence of the particular frequencies of the signal (filtering). It will be understood that 
these techniques may be combined to achieve any desired level adjustment. 

The various audio channels may be rendered by a single, common transducer, 
10 such as a loudspeaker. It is preferred, however, that the main channel and the at least one 

auxiliary channel are rendered by different transducers. This allows a spatial separation of the 
audio channels, thus making them easier to distinguish. 

In a preferred embodiment, the main channel is rendered by a transducer 
which is centraUy located with respect to the audio system of which it is part. This allows the 
15 main channel to be heard clearly and distinctly, in particular when the auxiUary channels are 
rendered by non-centrally located transducers, for example transducers located to the side(s) 
of an apparatus. 

A particularly flexible embodiment of the device of tiie present invention is 
further provided with transducer selecting means for selecting one or more transducers which 
20 render the main channel and the auxiliary channel(s) respectively. 

It will be dear to those skilled in the art that the above features of the device 
according to the present invention may be present in isolation or in combination. More in 
particular, any of the features discussed above may be provided in coiribination with one of 

more of the other features. 

25 The present invention further provides a remote control unit for use with the 

device as defined above, the unit comprising selection interfece components, such as buttons, 
for selecting the main channeL The remote conti^l unit of the present invention may 
advantageously further comprise a first sound level interface component^ such as a toggle 
stick, for setting a ratio of sound levels of rendered channels. Alternatively, or additionally, 

30 the remote contorol unit may further comprise second sound level interfece conaponents, such 
as knobs, for manually adjusting the sound levels of rendered channels. 

The present invention additionally provides an audio system, preferably an 
audio-visual system, comprising a device as defined above. Such an audio system may 
suitably be constitiited by a television set, a music center, or a home entertainment system 
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(which may include a personal computer). A remote control unit for use with such an audio 
system is discussed above. 

The present invention also provides a method of controlling the sound levels 
of a group of audio channels comprising a main channel and at least one auxiUary channel 
which can be rendered simultaneously, the method comprising the steps of: 

- selecting, under user control, the main channel, and 

- automatically adjusting the sound level of the at least one auxiUary channel 
relative to the main channel. 

Typically, the method of the present invention fiirther comprises the step of 
adjusting, under user control, the sound level of the main channel, although this step is not 
essential and may be omitted in some embodiments. 

The present invention fiuHier provides a computer program product for 
carrying out the method defined above. 



The present invention wiU further be explained below with reference to 
esxemplaiy embodiments illustrated in the accompanying drawings, in which: 

Fig. 1 schematically shows a first embodiment of a device according to the 
present invention. 

Fig. 2 schematically shows a second embodiment of a device according to Ihe 
present invention. 

Fig. 3 schematicaUy shows a first embodiment of a level adjustment unit for 
use in the device of Figs. 1 and 2. 

Fig. 4 schematicaUy shows a second embodiment of a level adjustment unit for 
use in the device of Figs. 1 and 2. 

Fig. 5 schematicaUy shows a third embodiment of a level adjustment unit 
according to the present invention. 

Fig. 6 schematicaUy shows a remote control unit for use with the device of Ihe 
present invention. 

Fig. 7 schematicaUy shows a home cinema system containing a device of the 
present invention. 
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The device 1 shown merely by way of non-limiting example in Fig. 1 
comprises a first level adjustment unit 1 1, a second level adjustment unit 12 and a third level 
adjustment unit 13 arranged in parallel. The three channels Chi, Ch2, and Ch3 are coupled to 
the inputs of the level adjustment units 1 1, 12 and 13 via a switching unit 14. The outputs of 
the level adjustment units 1 1, 12 and 13 are, in the exemplary embodiment shown, coupled to 
a signal addition unit 15 which, in tarn, is coiq)led to a transducer (loudspeaker) 2 for 
rendering the audio signals of the three channels. 

The channels Chi, Ch2, and Ch3 may, for example, be constituted by 
multimedia channels containing both audio and video (sub-)channels. The audio channels 
contain audio signals which are associated with respective video channels containing video 
signals that are to be rendered simultaneously. Alternatively, or additionaUy, the channels 
Chi, Ch2, and Ch3 may comprise one or more radio channels. The channels Chi, Ch2, and 
Ch3 may be transmitted via radio, cable, telephone lines, or other communication means. 

The switching unit 14, which is controlled by a selection signal Sel, connects 
15 one of the channels Chi, Ch2, and Ch3 to each of the level adjustment units 11, 12 and 13. 
The selection signal Sel, which is typically initiated by a user, selects one of the channels 
Chi, Ch2, and Ch3. The selected channel, labeled main channel MC, is fed to the first level 
adjustment unit 1 1, while the remaining channels, labeled first auxiUary channel ACl and 
second auxiUary channel AC2, ate fed to the second level adjustment unit 12 and the third 
20 level adjustment unit 1 3 re^ectively . 

Instead of the three channels shown, four or more channels may be present, 
even when only three adjustment units are provided. In some embodiments, therefore, the 
number of channels may exceed the number of adjustment units. In such embodiments, an 
additional selection signal may be used to select the rendered channels out of the available 
25 channels. 

The adjustment units 11, 12 and 13 receive control signals for adjusting the 
signal levels. The first level adjustment unit 1 1 receives a user control signal UC, while the 
second and third adjustment units 12 and 13 receive control signals that are (identical to or 
derived firom) the output signal of the first adjustment unit 1 1 . This output signal is the 
30 adjusted main channel (MC) signal. The user control signal UC typically comprises a 

numerical value (or an equivalent signal) representing a gain setting, for example a value 

ranging firom 1 to 20. 

As can be seen, the main channel MC can be adjusted under user control, 
while the auxiliary channels ACl and AC2 are adjusted in dependence of the main channel. 
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In this way, a change in flie sound level of the main channel MC may automatically result in 
changes in the sound levels of the auxiliary channels. 

Although the main channel may he exclusively user controlled, an 
embodiment can be envisaged in which the sound level of the main channel can be 
automatically adjusted in response to Ihe sound level in the auxiUaiy channels, and/or in 
response to changes (for example the sound level, the sound type and'or the content) in the 
main channel itself. The sound level of the main channel may therefore be adjusted either 
upwards (sound level increase) or downwards (sound level decrease) and need not be fixed or 
solely determined by tiie user. Accordingly, the main channel may also be primarily user 
controlled. The level adjustment units 1 1, 12 and 13 will later be explained in more detail 
with rej^ence to Fig. 3. 

The selection signal Sel and the user (level) control signal UC may originate 
fiom a (remote) control unit which is lypicaUy present in a television set or similar apparatus. 
In accordance with the present invention, such a remote control unit, an example of which is 
schematically shown in Fig. 6, comprises means for selecting the main channel. 

In the exemplary embodiment of Fig. 2, the device 1 of the present invention 
is designed for two channels. Each channel Chi, Ch2 is directly coupled to a respective level 
adjustment unit 1 1, 12. In contrast to Fig. 1, therefore, none of Ihe level adjustment units 
shown is specifically dedicated to the main channel MC and either level adjustment unit can 
be the first level adjustment unit 11 or the second level adjustment unit 12. For this purpose, 
a selection unit 1 6 is provided which is controlled by Ihe selection signal Sel. In a firat mode, 
the selection unit 1 6 feeds the user control signal UC to the first level adjustment unit 1 1 and 
feeds the output signal of the first level adjustment unit 1 1 as a control signal to the second 
level adjustment unit 12. hi a second mode, the selection unit 16 feeds the user control signal 
UC to Ihe second level adjustment unit 12 and feeds ffae output signal of the second level 
adjustment unit 12 as a control signal to the first level adjustment unit 1 1. The selection unit 
16 switohes between these two modes under control of the user control signal UC. 

As shown in Fig. 2, each level adjustment unit 1 1, 12 is connected to an 
individual transducer aoudspeaker) 2, 3. It wiU be understood that individual loudspeakers 
may also be employed in the embodiment of Fig. 1, in which case the signal addition unit 15 
WiU be deleted. It is also possible to use one transducer (or set of transducers) for the main 
channel MC and another transducer (or set of transducers) for the combined auxiliary 
channels. Alternatively, a signal addition unit (15 in Fig. 1) may be used in the embodiment 
of Fig. 2. 
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An exemplary embodiment of a level adjustment unit is schematically 
depicted in Fig. 3. The level adjustment unit 10 (which may correspond to any of the level 
adjustment units 11, 12 and 13 discussed above) comprises a controlled amplifier 17 and a 
level control unit 18. The level control unit 18 receives the signal of the channel concerned 
5 and passes it on to Ihe controlled amplifier 17 while measuring one or more characteristics of 
the signal, such as its level (ampUtude). The level control unit 18 also receives a control 
signal Cntl, for example firom the selection unit 16 shown in Fig. 2, or &om another level 
adjustment unit. This control signal may for example be the user control signal UC produced 
by the user, the audio ou^ut signal of one of the other level adjustment units, the (preferably 
10 delayed) output signal of the level adjustment unit itself, and/or a combination thereof. 

The level control unit 18 schematically illustrated in Fig. 3 may comprise 
suitable processing means for processmg the control signal and the channel signal so as to 
produce a suitable amplifier control signal for the amphfier 17. These processing means may 
advantageously conq)rise a microprocessor and an associated memory. The memory may be 
15 used to store, among other things, pre-set and user adjusted sound levels, pre-set and user 
adjusted sound ratios, user preferences regarding content, and/or other information. 

Various processing techniques maybe used. The sound levels of the auxiUary 
channels may be set to a certain percentage of the main channel, for example 20%, 30% or 
40% (these percentages may also be calculated on the basis of the total sound level produced 
20 by all channels together, in that case the main channel may, for example, be allocated 80% 

and the auxiUary diannels 20% of the total sound volume). These percentages may be pre-set 
in the fectory and may be based upon statistical user Ustening tests. Such tests may indicate 
which percentages yield a good inteUigibiUty and/or a suitable channel balance. 

The respective levels may be based vapaa a calculation of the total signal 
25 volume or signal power. For exanqjle, the signal power of the main channel may be 

calculated for a certain (typically short) time period using welt-known techniques such as 
integrating the square of the amplitude over said time period. The same calculation is carried 
out for the auxiUary channels). If a certain target ratio of the signal powers (or volumes: the 
integral of the absolute value of the signal) is given, the adjustment is carried out in such a 
30 way that the actual ratio becomes (approximately) equal to the target ratio. 

The level control unit 1 8 may therefore be provided with a division unit for 
dividing the sound (signal) levels of the level control units and determining a percentage 
(ratio). The level control unit 1 8 may further be provided with a comparison unit for 
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comparing the calculated ratio with a predetermined (that pre-set or previously altered) ratio 
and deriving a compensation signal from any deviation. 

In a particularly advantageous embodiment, the levels depend on the channel 
content and/or on the channel signal characteristics. With regard to content, different sound 
levels may be assigned to, for example, speech and music. A user wiU typicaUy want to hear 
what is being said on the main channel and may accept a certain level of background music 
of an auxiliary channel. Conversely, when music is rendered on both the main channel and 
Ihe auxiUaiy channel(s) and an auxiUaiy channel changes to speech, the user will lypicaUy 
want this auxiUary channel to be rendered louder in order to be able to hear and underatend 
the speech. Accordingly, the present invention may advantageously provide automatic 
content type detection which may distinguish between, for example speech, music, noise and 
silence. In addition, different types of music may be distinguished, for example classical 
music, hard rock, jazz, blues, ete.. 

Determining the audio content of a channel can be carried out in various ways. 
Information (so called meta-information or meta-data) on the content may be available, for 
example the RDS (Radio Data System) information which may be broadcast togeflier 4fh 
radio signals and typically indicates the broadcasting station, the artist, and other information. 
Such meta-data may also be transmitted via other communication channels, such as the 
Internet. If this information includes the type of music, it can be used to determine the type of 
the audio content and adapt the levels of the channels accordingly. Other suitable information 
that may be used is EPG (Electronic Program Guide) information and/or the so-caUed IDS 
tag of MPS. 

Altematively, or additionaUy, an indication of the audio content could be 
achieved using audio analysis such as speech detection and/or speech analysis. Speech 
analysis could, furthermore, involve prosody analysis and/or key word recognition, so that 
the device of the present invention may adapt the channel levels to user preferences. Other 
ways of determining content could be based upon the analysis of video or still images 
associated with the audio content, for example in the case of television. 

Instead of, or in addition to (amplitude) level adjustment based upon content, 
level adjustment could be carried out on the basis of signal characteristics. Signal analysis 
involving, for example, average signal amplitude and dominant frequencies (spectral 
analysis) can assist in automatically choosing a suitable level adjustment It will be 
understood that signal analysis may also assist in determining the content of the channels. 



PHNL040391EPP 

11 08.04.2004 
In the above discussion it has been assumed that the sound level adjustment of 
ttie various channels involved gain acyustment, that is. Hie signal of the channel is multipUed 
with a suitable gain factor (typically smaller than 1), resulting in the desired sound level. 
Although this is a very suitable technique, the present invention is not so limited. More in 
5 particular, the sound levels of the various channels may be reduced or adjusted using other 
techniques, such as cUpping, compression and filtering. The cUpping technique, which is 
known j3er se, involves limiting the signal ampUtude to a certain threshold leveL Although 
this technique may introduce some signal distortion, it is very simple and effective. Any 
signal distortion may be significantly decreased by "soft cUpping", tiiat is cUpping in which 
10 the signal ampUtude above the threshold value is (proportionaUy) reduced by multiplying tiie 
signal with a fector instead of "cutting ofiE". 

Another suitable technique which is known /?erse is filtering, which aUows 
the signal ampUtude to be reduced in dependence of the fisquency. Using filtering, specific 
ftequency ranges of the audio channels can be selectively reduced, instead of, or in addition 
15 to, adjusting the overaU level of the channeL In this way it is possible to reduce the sound 
level in accordance with the sensitivity of tiie human ear: certain firequency ranges which 
cause more perceptual interference of simultaneously rendered sound (audio channels) could 
be reduced mpre than others firequency ranges. 

The sound level as experienced by the human ear is not only determined by 
20 the actual signal power of the sound but also by psychological fectors. This phenomenon can 
advantageously be used to provide "inteUigentf' sound level processing, as in the exemplary 

embodiment of Fig. 4. 

The sound level adjustment unit 10 of Fig. 4, which may correspond to any of 

the units 1 1, 12 and 13 of Figs. 1 and 2, comprises in the exen^plary embodiment shown a 

25 first temporal analysis (TEMPAN) module 102, a compressor (COMP) 104, a multi-band 

spUttser (MULTIBAN) 106, a first fest Fourier transform (FFT) unit 108, a signal dependent 

filter (SDF) 1 10, a multi-band filter (MBFIL) 1 12, an (optional) controUed ampUfier 115, a 

critical band filter (CRITBF) 1 16, a switch 118, a second temporal analysis module 

(TEMPAN) 120, a second fast Fourier transform (FFT) unit 122, and a parameter setting 

30 (PARS) unit 124. The level adjustment unit 1 0 receives type information fixjm a type 

information unit (TPY) 130. 

The first input signal of the unit 10 is the audio signal of an auxiUary channel 
ACl or AC2, the perceptive sound level of v*ich is to be reduced (it is noted that reducing 
the sound level of an auxiUaiy channel is substantially equivalent to increasing the sound 



PHNL040391EPP 



10 



15 



12 08.04.2004 
level of the main channel as in both cases the relative sound level of the auxiliaiy channel is 
decreased). 

The sound signal of the auxiUaiy channel ACl or AC2 is temporaUy analyzed 
by the temporal analysis unit 102: the history of the signal is determined for a certain time 
period or "time sUce", for example ranging from tl to t2. The temporal analysis unit 102 is 
arranged for activating the other components of lie unit 10 only for certain time periods. If 
there is a peak in the sound level (as schematicaUy depicted in Fig. 4) between tl and t2, it 
may be advantageous to process the audio signal only for this time slice, leaving the 
remainder of the signal level unchanged. This may typically occur during a commercial break 
in a television program, in which case so-called commercial detection by other means may 
help in automaticaUy selecting the lowest of a set of user ratio preferences, for example a 
ratio of 100/0 (that is, main channel 100%, auxiliary channel 0%). 

Subseqaentiy, the audio signal may be compressed in its entirety, or for certain 
time sUces only, by the compressor 104, thus in^josing a compression relationship on Ihe 
input levels to obtain the desired oulput levels. The compressor may optionally be controlled 
by Ihe output signal of parameter setting unit 124, which in tum is derived from the control 
signal Cntl. 

The compressor 104 may use any suitable compression technique. In the 
example shown, a compression function is used which may be defined mathematicaUy as: 
20 So = Si / K(Si) + B(Si) , 

where Si is the input signal of the confessor 104 (in the present example, the audio signal of 
an auxiUary channel ACl or AC2), Sq is the output signal of the compressor 104, K is a 
scaling fector which may depead on the input signal Si and B is an addition factor which may 
also depend on the input signal Si. Typically, K increases as Si increases, thus compressing 
high amplitude signals more tiian low amplitude signals. Compression techniques are weU 
known in the art and the particular conq)ression technique used is not essential to the present 
invention. 

After being compressed, the audio signal may be filtered, for example using 
the multi-band spUtter 106 and the relatively simple multi-band filter 1 12 which is arranged 
for filtering tiie signal per frequency band. The multi-band filter 1 12 may be provided with 
arnphfiers 1 13 for each frequency band. 

The filter characteristics of the multi-band filter 1 12 may be fixed, however, 
they can also be dependent on type information provided by a type determining unit 130 



25 
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arranged for determining the type of liie audio signal, for example pop music or classical 
music. In this way, the bass of pop music may be reduced by adjusting the gain of the 
respective amplifier 1 13 for the low frequency bands. 

Additionally, or alternatively (as indicated by the switch 1 1 8), a Fourier 
transform may be calculated by the first Fourier transform unit 108, followed by fixed or 
adaptive frequency domain filtering by the frequency domain filter 110, which is, in die 
embodiment shown, a signal dependent filter. The filter 1 10 may be adapted in response to an 
adaptation signal which is derived fixjm the control signal Cntl (see also Fig. 3), which may 
be the (level adjusted) audio signal of the main channel, as discussed above with reference to 
10 Figs. 1 and 2. This adaptation signal is derived from tiie control signal Cntl using the second 
temporal analysis unit 120, the second fast Fourier transform unit 122 and the parameter 
setting unit 124. The filter 110 may for example suppress the (higher) ampUtudes of audio 
frequency components in the channel ACl or AC2 in a typical frequency band or in a 
frequency band which is actually determined by measurements. The output signal of the filter 
15 1 10 is fed to an (optional) amplifier 1 15 which serves to compensate for any decrease in the 
(average) signal level caused by the filter 110. The output signal of Ibe anq»lifier 115 (or, if 
the amplifier 1 15 is not present, the output signal of the filter 1 10) is fed to the switeh 1 18 to 
be selectively coupled to the oui^ut of the unit 10. 

It is noted that it may be advantageous to use a critical band filter 116 that has 
20 a filtering characteristic modeled in accordance with the human auditory system. The audio 
signals may, for example, be spUt up in accordance with the well-known critical band theory, 
and the audio levels of the auxiliary channels (and/or main channel) are changed in 
dependence on their (mutual) interferences in each critical band. 

The exemplary embodiment of a level adjustment module 15 schematically 
25 shown in Fig. 5 comprises an inteUigibility improvement unit (ESmMP) 152, an envelope 
unit 154, a topic detector (TOPDET) 156, a prosody analyzer (PROS) 158 and a keyword 
detector (KEYWD) 160. The keyword detector 160 receives relevant keywords from a 
keyword database (KEYDAT) 190 which may be external to the unit 10. The level 
adjustment unit 15 may represent any of the level adjustment units 11, 12 and 13 shown in 
30 Figs. 1 and 2. However, in contrast to the level adjustment units 10 of Figs. 3 and 4, the unit 
15 of Fig. 5 does not have a control irput (Cntl in Fig. 3). The unit 15 of Fig. 5 as shown is 
therefore suitable for embodiments of flie device according to the present invention in which 
user control (UC in Fig. 1) is not required or is achieved by other means. However, the 
control signal Cntl (see Fig. 3) could be fed to the inteUigibiHty improvement unit 152. 
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Alternatively, or additionaUy, the control signal Cntl could be fed to the topic detector 156, 
the prosody analyzer 158 and a keyword detector 160, instead of or in addition to the sound 
signals of the channels MC, ACl and/or AC2. 

The unit 15 is also suitable as an additional level control unit, arranged in 
series with a unit 1 0 of Fig. 3 or 4, for improving the intelligibility of speech before or after 
adjusting its level. 

As illustrated in Fig. 5, tiie main channel MC and the auxiUary channels ACl 
and AC2 (if present) may be level adjusted by the unit 15, resulting m level adjusted channels 
MC',ACl'andAC2'. 

The audio signal contained in the channel (MC, ACl, AC2) of interest is 
improved by the intelligibmiy in^ovement module 1 52 by increasing tiie amplitudes of 
formants (as schematically illustoted in Fig, 5) or by other signal processing techniques 
known in the art. An envelope unit 154 then adjusts the envelope E of the improved audio 
signal. More in particular, the envelope unit 154 is preferably ananged for temporarily 
> changing (increasing or decreasing) the envelope of tiie audio signal. To fliis end, the 

envelope unit 154 is provided witii a contiroUed amplifier or equivalent means for adjusting 
the gain of the signal. 

A preferred embodiment of the envelope unit 154 is arranged for increasing 
the sound level at certain moments of interest. These moments may be detected by a topic 
detector 156 which may be arranged to analyze tiie audio and video content of flie channel 
and detect certain features, for example pauses in speech, pauses in motion, end of a video 
insertion, and reappearance of a central character (such as a news reader) by fece detection 
and similar techniques. 

A prosody analyzer 1 58 is provided for analyzing tiie prosody in the speech 
and enhancing flie proso<fy by sending a suitable prosody enhancement signal to envelope 
unit 154. 

Keywords may also be detected using the keyword detector 160 and tiie 
associated keyword database 190. The keyword database 190 is updated using, for example, 
EPG (Electoonic Programming Guide) information summarizing tiie topics of television 
programs, and/or monitoring tfie interests of a user. If tiie rendering of a channel is suitably 
delayed, detected keywords may be rendered louder, tiius alerting tiie user to tiiese words. 

The exemplary remote control unit 6 shown in Fig. 6 comprises a toggle stick 
61, a channel selection button 62, a channel change button 63, (optional) auxiliary channel 
adjustnent knobs 64 and 65, a ratio adjustnent assembly 66 comprising four buttons 67, and 
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a screen 68 on which a channel overview 69 is displayed. In the example shown, the remote 
control unit 6 operates a television set and/or a home video system. 

Channel change hutton (key) 63 allows the channels heing rendered to be 
changed by the user. In a typical embodiment, depressing fbs channel change button 63 
repeatedly wiU result in the displayed channels to '*rotate", that is, to displayed in succession. 
Depressing the selection button (key) 62 selects ihe current channel (NLl in the example 
shown) as the main diannel. This selection of the main channel will generate a selection 

signal (Sel in Figs. 1 and 2). 

Alternatively, a keypad (not shown) may be provided to enter a channel 
number of a first channel to be rendered, depressing the channel selection button 62 will 
select this rendered channel as the main channel and generate the selection signal, after which 
any further channel number entered in the keypad will display Ihe (first) auxiUary channel 
(SBS6 in the example shown). It will be understood that the manner in which two or more 
channels out of a plurality of channels are chosen is not essential to the present invention. 

In the embodiment shown, the sound level ratio of the main channel and the 
auxiUary channels) may be adjusted by the user. To this end, the toggle stick 61 , which 
essentiaUy is a switch that can be moved fix)m a central neutral position to either a left active 
position or a right active position, is arranged in such a way the user can step through a 
HiMhber of ratios. Assuming an initial situation in which the (factory set or programmed) ratio 
of the sound levels is 70/30 (that is, main channel 70% of total sound level, auxiUary channel 
30%), moving the toggle stick to the left once may change the ratio into 80/20, and doing this 
twice may result in a ratio of 90/10. Similarly, moving the toggle stick to the right once may 
change the ratio firom 70/30 into 60/40. The remote control unit may be arranged such that 
the ratio cannot exceed a threshold value, for example 50/50. 

The remote control unit and/or the level adjustment units of the present 
invention may advantageously be designed such that activating the toggle stick 61, or any 
equivalent sound level inlerfece con^onent, causes a tenqjorary balance adjustment which 
lasts for a duration of, for example, approximately one second or several seconds, after which 
the sound levels revert to their previous values. 

In some embodiments using the toggle stick 61 may provide an alternative 
way of selecting the main diannel, producing a selection signal Sel when the ratio reaches 
40/60, for example. TTiose skiUed in the art wiU understand that various alternative or 
complementary arrangements are possfljle and that the toggle stick 61 is a useful but optional 
feature of the remote control unit 6. 
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Each newly selected ratio may be displayed, for exaniple on a screen of the 
remote control unit or on the screen of an associated television set. Alternatively, an axital 
indication could be provided, for example an audible signal produced by a signal generator or 
a speech generator. 

An alternative way of adjusting the ratio of the rendered channels is provided 
by (optional) auxiliary channel adjustment knobs 64 and 65. Rotating each of these knobs 
causes the level of the respective auxiliary channel to be adjusted. This manual adjustment is 
m addition to the automatic adjustment provided by the present invention. Embodiments can 
be envisaged in which the automatic adjustment overrides the manual adjustment or vice 
versa. 

A further optional feature of the remote control unit 6 is the ratio adjustment 
assembly 66 which comprises four buttons 67. These buttons may serve to manually adjust 
sound levels (and/or sound level ratios) m the respective channels and to choose the channel 
to be adjusted. 

It wiU be understood that the main channel selection bu«»n of a remote control 
unit according to the present invention is typically distinct from the usual channel selection 
buttons of a remote control which merely serve to select a channel to be rendered. The main 
channel selection button (or its equivalent) determines which channel of the channels being 
rendered simultaneously is to be the main channel, that is the channel under direct user 
control, in contrast to the auxihary channels the levels of which are automatically controlled 
relative to the main channel. 

It wiU further be understood that instead of using a remote control unit, other 
controls are possible. The device of the present invention could, for example, be provided 
with a speech command interpreter. 

As discussed above, the level adjusted audio channels can be rendered using a 
single, common transducer (such as a loudspeaker) or set of transducers reproducing summed 
signals, or using individual transducers or sets of the transducers for one or more channels. 
The main channel is preferably rendered using a separate transducer or set of transducers (it 
will be understood that a set of transducers may comprise, for example, a woofer and a 
tweeter, or other combinations of loudspeakers, resonators and/or other transducers), hi a 
particularly advantageous embodiment, the main channel is rendered using a centraUy located 
transducer (or set of transducers), while the auxiliary channels) is/aie rendered using 
lateraUy located transducers (or sets of transducers). Such an arrangement is schematicaUy 
shown in Fig. 7, where a television set 9 is provided with a centrally positioned loudspeakw 2 
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for readering the main channel and four laterally positioned loudspeakers 3, 3% 4 and 4' for 
rendering tiie auxiliary channels). The television set 9 is further provided with a device (1 in 
Figs. 1 and 2) according to the present invention. In the example shown, the television set has 
a screen 8 that is divided in two parts which are schematically indicated I and H. Each part is 
assigned a channel comprismg both audio and video. As mentioned above, it is preferred that 
the central loudspeaker 2 renders the sound of the main channel which is displayed in, for 
exan^le, screen section I, while flie lateral loudspeakers 3 and 4 render the sound of tiie 
auxiliary channel, the video of which is in this example rendered by screen section H. 

The television set 9 shown in Fig. 7 is part of a home cinema system which 
further comprises a set-top box 7 and stand-alone loudspeaker units 3' and 4'. 

The present invention is based upon the insist ftat the sound levels of several 
audio channels which can be rendered simultaneously shouldbe controlled intetdependentiy: 
adjusting the sound level of one dbannel may require the adjustment of one or more other 
channels. The present invention benefits from the fiirther insight that user contixjl of multiple 
channel is feciUtated if the user has to control tiie sound level of a single, main channel only, 
all other channels being controUed automatically in dependence of the main channel. 

The term computer program product should be understood to include any 
physical realization, e.g. an article of manufacture, of a collection of commands enabling a 
processor -generic or special purpose-, after a series of loading steps to get the commands 
into the processor, to execute any of the characteristic functions of an invention, hi particular 
the computer program product may be realized as program code, processor adapted code 
derived from tiiis program code, or any intermediate translation of this program code, on a 
carrier such as e.g. a disk or other plug-in component, present in a memory, tenqwrarily 
present on a network connection -wired or wireless- , or program code on paper. Apart from 
program code, invention characteristic data required for the program may also be embodied 

as a coir^uter program product 

It is noted that any terms used in this document should not be constiiied so as 
to limit the scope of the present invention. In particular, tiie words "comprise(s)" and 
"comprising" are not meant to exclude any elements not specificaUy stated. Single (circuit) 
elements may be substituted witii multiple (circuit) elements or with their equivalents. 

It will be understood by those skilled in the art that the present invention is not 
limited to the embodiments iUustrated above and tiiat many modifications and additions may 
be made without departing from the scope of the invention as defined in tiie appending 
claims. 




I 



PHNL040391EPP 

18 08.04.2004 

CLAIMS: 



1 . A device (1) for controlling the sound levels of a group of audio channels 
comprising a main channel (MC) and at least one auxiliary channel (ACl) which can be 
rendered simultaneously, the device comprising: 

- user controlled selection means (14, 16) for selecting the main channel, and 

- automatic level adjustment means (12, 13) for adjusting the sound level of 
the at least one auxiliary channel relative to the main channel. 

2. The device according to claim 1, wherein the selection means (14, 16) are 
arranged for selecting successive available channels in response to user input (Sel). 

3. The device according to claim 1 or 2, wherein the level adjustment means (12, 
13) are arranged for providing pre-set relative sound levels. 

4. The device according to claim 3, further arranged for altering the pre-set 
relative sound levels by the user. 

5. The device according to claim 1, wherein the level adjustment means (12, 13) 
are arranged for adapting the respective sound levels to the content of each associated audio 
channel (ACl, AC2). 

6. The device according to claim 5, further arranged for adapting the respective 
sound levels to user preferences regarding the content. 

7. The device according to claim 1, wherein the level adjusttnent means (12, 13) 
are arranged for adapting the respective sound levels to the signal characteristics of each 
associated audio channel (ACl, AC2). 



8. The device according to claim 7, whereia the level adjustment means (12, 13) 

are arranged for speech detection. 
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9. The device according to claim 8, wherein the level adjustment means (12, 13) 

are further arranged for speech analysis. 

5 10. The device according to claims 5 and 7, wherein the level adjustment means 

(12, 13) are arranged for temporarily adjusting the sound level of a channel (MC, ACl, AC2) 
in response to the content and/or signal characteristics of at least one channel (MC; ACl). 

1 1 • The device according to claim 1 , wherein the level adjustment means (12, 13) 

10 are arranged for gradually adjusting the sound level. 

12. The device according to claim 1, wherein the level adjustment means (12, 13) 
are arranged for clipping, compressing and/or filtering audio signals contained in the 
channels. 

15 

13. The device according to claim 1 , wherein the main channel (MC) and the at 
least one auxiliary channel (ACl) are rendered by different transducers (2, 3, 4). 

14. The device of claim 13, further provided with transducer selecting means for 
20 selecting a transducer (2, 3, 4) which renders the main channel (MC) and/or the at least one 

auxiliary channel (ACl, AC2). 

15. A level adjustment means (1 1, 12, 13) for use in the device according to claim 
1. 

25 

16. A remote control unit (6) for use with the device according to claim 1 , 
comprising selection interface components (62, 63), such as buttons, for selecting the main 
channel (MC). 

30 17. The remote control unit according to claim 16, further comprising a first sound 

level interface component (61), such as a toggle stick, for setting a ratio of sound levels of 
rendered channels (MC, ACl, AC2). 
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18. The remote control unit according to claim 16 or 17, further comprising 

second sound level interface components (64, 65), such as knobs, for manually adjusting the 
sound levels of rendered channels (MC, ACl, AC2). 

5 19. An audio system comprising a device (1) according to claim 1 . 

20. A home entertainment system comprising a device (1) according to claim 1 . 

21. A television system (9) comprising a device (1) according to any of claim 1. 

10 

22. The system according to claim 19, 20 or 21, wherein the main channel (MC) is 
rendered by a transducer (ACl ) which is located centrally relative to the system. 

23. A method of controlling the sound levels of a group of audio channels 

15 comprising a main channel (MC) and at least one auxiliary channel (ACl) which can be 
rendered simultaneously, the method comprising the steps of: 

- selecting, under user control, the main channel, and 

- automatically adjusting the sound level of the at least one auxiliary channel 
(ACl) relative to the main channel. 

20 

24. The method according to claim 23, wherein the sound level of the at least one 
auxiliary channel (ACl; AC2) is set using a plurality of pre-set relative sound levels. 

25. The method according to claim 23, wherein the respective sound levels of the 
25 at least one auxiliary channel (ACl ; AC2) are adapted to the content of each associated audio 

channel. 

26. The method according to claim 23, wherein the respective sound levels of the 
at least one auxiliary channel (ACl; AC2) are adapted to the signal characteristics of each 

30 associated audio chaxmel. 

27. The method according to claim 25 or 26, wherein speech detection is used. 
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28. The method according to claim 27, wherein fortnant detection, prosody 
detection and/or keyword detection is used. 

29. A computer program product for carrying out the method according to claim 
22. 
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ABSTRACT: 



A device (1) is arranged for controlliiig the sound levels of a group of audio 
channels including a user selected main channel (MC) and at least one auxiUary channel 
(ACl; AC2). The audio channels can be rendered simultaneously. The device comprises 
automatic level adjustment means (12, 13) for adjusting the sound level of the at least one 
auxiliary channel relative to the main channel. The level adjustment means (12, 13) may be 
arranged for adapting the respective sound levels to the content or signal characteristics of 
each associated audio channel. 
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